SWASH: A Naive Bayes Classifier for Tweet Sentiment Identification

نویسندگان

  • Ruth Talbot
  • Chloe Acheampong
  • Richard Wicentowski
چکیده

This paper describes a sentiment classification system designed for SemEval-2015, Task 10, Subtask B. The system employs a constrained, supervised text categorization approach. Firstly, since thorough preprocessing of tweet data was shown to be effective in previous SemEval sentiment classification tasks, various preprocessessing steps were introduced to enhance the quality of lexical information. Secondly, a Naive Bayes classifier is used to detect tweet sentiment. The classifier is trained only on the training data provided by the task organizers. The system makes use of external human-generated lists of positive and negative words at several steps throughout classification. The system produced an overall F-score of 59.26 on the official test set.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

GTI at SemEval-2016 Task 4: Training a Naive Bayes Classifier using Features of an Unsupervised System

This paper presents the approach of the GTI Research Group to SemEval-2016 task 4 on Sentiment Analysis in Twitter, or more specifically, subtasks A (Message Polarity Classification), B (Tweet classification according to a two-point scale) and D (Tweet quantification according to a two-point scale). We followed a supervised approach based on the extraction of features by a dependency parsing-ba...

متن کامل

An Empirical Study on Machine Learning for Tweet Sentiment Analysis

Tweet sentiment analysis has been an effective and valuable technique in the sentiment analysis domain. As the most widely used approach for tweet sentiment analysis, machine learning algorithms work well on the sentiment classification, just as they have been successfully applied for many other purposes. In this thesis, we conduct a systematic and thorough empirical study on the machine learni...

متن کامل

I act , therefore I judge : Network sentiment dynamics based on user activity change Supplemental Material

We annotate individual posts following the approach in [1], [2]. Tweets are grouped by topic based on included topic hashtags. For example, tweets relating to the topic of president Barack Obama contain the hashtag #obama within them. The topic-related tweets often contain other hashtags which we assign a preliminary sentiment probability (positive and negative) using the Multinomial Naive Baye...

متن کامل

Sentiment Analysis for Social Media

The proposed system is able to collect useful information from the twitter website and efficiently perform sentiment analysis of tweets regarding the Smart phone war. The system uses efficient scoring system for predicting the user’s age. The user ‘gender is predicted using a well trained Naïve Bayes Classifier. Sentiment Classifier Model labels the tweet with a sentiment. This helps in compreh...

متن کامل

Twitter Sentiment Analysis: Lexicon Method, Machine Learning Method and Their Combination

This paper presents a step-by-step methodology for Twitter sentiment analysis. Two approaches are tested to measure variations in the public opinion about retail brands. The first, a lexicon-based method, uses a dictionary of words with assigned to them semantic scores to calculate a final polarity of a tweet, and incorporates part of speech tagging. The second, machine learning approach, tackl...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015